Xiaomi Open Sources 309 Billion Parameter MiMo-V2-Flash Large Model, Inferencing Speed Outperforms Mainstream Competitors, API as Low as $0.1 per Million Tokens
Xiaomi releases the open-source large model MiMo-V2-Flash, which is designed for high speed and efficiency, showing outstanding performance in tasks such as inference and code generation, with response speed surpassing multiple popular domestic models. The model adopts a sparse activation architecture, with 309 billion parameters, and the weights and code are open-sourced under the MIT license.